Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 49247 |
| Missing cells | 2363 |
| Missing cells (%) | 0.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.1 MiB |
| Average record size in memory | 216.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 13 |
DIRNAME has a high cardinality: 6843 distinct values | High cardinality |
CONAME has a high cardinality: 495 distinct values | High cardinality |
CUSIP has a high cardinality: 495 distinct values | High cardinality |
ADDRESS has a high cardinality: 492 distinct values | High cardinality |
CITY has a high cardinality: 229 distinct values | High cardinality |
ZIP has a high cardinality: 384 distinct values | High cardinality |
SICDESC has a high cardinality: 178 distinct values | High cardinality |
NAICSDESC has a high cardinality: 203 distinct values | High cardinality |
INDDESC has a high cardinality: 122 distinct values | High cardinality |
TICKER has a high cardinality: 495 distinct values | High cardinality |
CASH_FEES is highly overall correlated with TOTAL_SEC | High correlation |
STOCK_AWARDS is highly overall correlated with TOTAL_SEC | High correlation |
TOTAL_SEC is highly overall correlated with CASH_FEES and 1 other fields | High correlation |
SUB_TELE is highly overall correlated with STATE | High correlation |
NAICS is highly overall correlated with SIC | High correlation |
SIC is highly overall correlated with NAICS | High correlation |
STATE is highly overall correlated with SUB_TELE | High correlation |
SPCODE is highly imbalanced (96.5%) | Imbalance |
STATE has 2082 (4.2%) missing values | Missing |
STOCK_AWARDS is highly skewed (γ1 = 221.6263592) | Skewed |
OPTION_AWARDS is highly skewed (γ1 = 48.18993794) | Skewed |
NONEQ_INCENT is highly skewed (γ1 = 198.0277207) | Skewed |
PENSION_CHG is highly skewed (γ1 = 58.1177729) | Skewed |
OTHCOMP is highly skewed (γ1 = 146.2004733) | Skewed |
TOTAL_SEC is highly skewed (γ1 = 221.0081778) | Skewed |
CASH_FEES has 3378 (6.9%) zeros | Zeros |
STOCK_AWARDS has 7479 (15.2%) zeros | Zeros |
OPTION_AWARDS has 42377 (86.0%) zeros | Zeros |
NONEQ_INCENT has 49088 (99.7%) zeros | Zeros |
PENSION_CHG has 47593 (96.6%) zeros | Zeros |
OTHCOMP has 31272 (63.5%) zeros | Zeros |
TOTAL_SEC has 1693 (3.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-05-08 04:15:34.250172 |
|---|---|
| Analysis finished | 2023-05-08 04:15:57.080767 |
| Duration | 22.83 seconds |
| Software version | ydata-profiling vv4.0.0 |
| Download configuration | config.json |
GVKEY
Real number (ℝ)
| Distinct | 495 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42838.429 |
| Minimum | 1045 |
|---|---|
| Maximum | 316056 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 1045 |
|---|---|
| 5-th percentile | 1913 |
| Q1 | 5742 |
| median | 11228 |
| Q3 | 61483 |
| 95-th percentile | 171007 |
| Maximum | 316056 |
| Range | 315011 |
| Interquartile range (IQR) | 55741 |
Descriptive statistics
| Standard deviation | 59686.274 |
|---|---|
| Coefficient of variation (CV) | 1.3932881 |
| Kurtosis | 1.2466855 |
| Mean | 42838.429 |
| Median Absolute Deviation (MAD) | 8058 |
| Skewness | 1.5516093 |
| Sum | 2.1096641 × 109 |
| Variance | 3.5624513 × 109 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 149070 | 264 | 0.5% |
| 11856 | 179 | 0.4% |
| 5047 | 170 | 0.3% |
| 8007 | 161 | 0.3% |
| 7647 | 157 | 0.3% |
| 8245 | 156 | 0.3% |
| 3144 | 154 | 0.3% |
| 3243 | 153 | 0.3% |
| 4723 | 153 | 0.3% |
| 184500 | 152 | 0.3% |
| Other values (485) | 47548 |
| Value | Count | Frequency (%) |
| 1045 | 127 | |
| 1075 | 117 | |
| 1078 | 117 | |
| 1161 | 101 | |
| 1209 | 101 | |
| 1230 | 100 | |
| 1300 | 112 | |
| 1327 | 79 | |
| 1380 | 128 | |
| 1440 | 127 |
| Value | Count | Frequency (%) |
| 316056 | 38 | 0.1% |
| 294524 | 113 | |
| 260774 | 108 | |
| 189491 | 87 | |
| 187697 | 58 | |
| 187450 | 58 | |
| 186989 | 99 | |
| 186310 | 82 | |
| 185532 | 68 | |
| 184996 | 74 |
DIRNBR
Real number (ℝ)
| Distinct | 32 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9952078 |
| Minimum | 1 |
|---|---|
| Maximum | 32 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 12 |
| Maximum | 32 |
| Range | 31 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.5664305 |
|---|---|
| Coefficient of variation (CV) | 0.59488021 |
| Kurtosis | 0.80657253 |
| Mean | 5.9952078 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.67740138 |
| Sum | 295246 |
| Variance | 12.719426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4792 | |
| 2 | 4790 | |
| 3 | 4788 | |
| 4 | 4776 | |
| 5 | 4733 | |
| 6 | 4631 | |
| 7 | 4459 | |
| 8 | 4139 | |
| 9 | 3627 | |
| 10 | 3005 | |
| Other values (22) | 5507 |
| Value | Count | Frequency (%) |
| 1 | 4792 | |
| 2 | 4790 | |
| 3 | 4788 | |
| 4 | 4776 | |
| 5 | 4733 | |
| 6 | 4631 | |
| 7 | 4459 | |
| 8 | 4139 | |
| 9 | 3627 | |
| 10 | 3005 |
| Value | Count | Frequency (%) |
| 32 | 1 | < 0.1% |
| 31 | 2 | < 0.1% |
| 30 | 3 | < 0.1% |
| 29 | 4 | < 0.1% |
| 28 | 4 | < 0.1% |
| 27 | 5 | |
| 26 | 5 | |
| 25 | 7 | |
| 24 | 10 | |
| 23 | 11 |
DIRNAME
Categorical
| Distinct | 6843 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| Shirley Ann Jackson | 45 |
|---|---|
| Michael L. Eskew | 41 |
| Alexis M. Herman | 40 |
| Suzanne M. Nora Johnson | 40 |
| Edward M. Liddy, M.B.A. | 38 |
| Other values (6838) |
Length
| Max length | 78 |
|---|---|
| Median length | 67 |
| Mean length | 19.037545 |
| Min length | 7 |
Characters and Unicode
| Total characters | 937542 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 657 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | Roger T. Staubach |
|---|---|
| 2nd row | Ann McLaughlin Korologos |
| 3rd row | Judith Rodin, Ph.D. |
| 4th row | David L. Boren |
| 5th row | Ray M. Robinson, Jr. |
Common Values
| Value | Count | Frequency (%) |
| Shirley Ann Jackson | 45 | 0.1% |
| Michael L. Eskew | 41 | 0.1% |
| Alexis M. Herman | 40 | 0.1% |
| Suzanne M. Nora Johnson | 40 | 0.1% |
| Edward M. Liddy, M.B.A. | 38 | 0.1% |
| Roxanne S. Austin | 38 | 0.1% |
| Patricia F. Russo | 38 | 0.1% |
| Steven S. Reinemund | 36 | 0.1% |
| Richard H. Lenny | 35 | 0.1% |
| Susan C. Schwab | 35 | 0.1% |
| Other values (6833) | 48861 |
Length
| Value | Count | Frequency (%) |
| j | 3732 | 2.3% |
| a | 3641 | 2.3% |
| m | 2948 | 1.8% |
| jr | 2799 | 1.7% |
| john | 2597 | 1.6% |
| l | 2457 | 1.5% |
| r | 2140 | 1.3% |
| ph.d | 2084 | 1.3% |
| c | 2032 | 1.3% |
| robert | 2020 | 1.3% |
| Other values (6918) | 134816 |
Most occurring characters
| Value | Count | Frequency (%) |
| 112019 | 11.9% | |
| e | 70978 | 7.6% |
| a | 64422 | 6.9% |
| r | 56194 | 6.0% |
| n | 52714 | 5.6% |
| . | 52582 | 5.6% |
| o | 42375 | 4.5% |
| i | 41959 | 4.5% |
| l | 39764 | 4.2% |
| h | 27544 | 2.9% |
| Other values (50) | 376991 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 572645 | |
| Uppercase Letter | 183084 | 19.5% |
| Space Separator | 112019 | 11.9% |
| Other Punctuation | 68310 | 7.3% |
| Dash Punctuation | 760 | 0.1% |
| Open Punctuation | 363 | < 0.1% |
| Close Punctuation | 361 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 70978 | |
| a | 64422 | |
| r | 56194 | |
| n | 52714 | |
| o | 42375 | 7.4% |
| i | 41959 | 7.3% |
| l | 39764 | 6.9% |
| h | 27544 | 4.8% |
| s | 27171 | 4.7% |
| t | 25821 | 4.5% |
| Other values (16) | 123703 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 17559 | 9.6% |
| M | 16043 | 8.8% |
| A | 14055 | 7.7% |
| C | 13318 | 7.3% |
| D | 13038 | 7.1% |
| S | 12272 | 6.7% |
| B | 11109 | 6.1% |
| R | 10887 | 5.9% |
| P | 10613 | 5.8% |
| L | 8442 | 4.6% |
| Other values (16) | 55748 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 52582 | |
| , | 15496 | 22.7% |
| ' | 222 | 0.3% |
| / | 10 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 112019 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 760 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 363 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 361 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 755729 | |
| Common | 181813 | 19.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 70978 | 9.4% |
| a | 64422 | 8.5% |
| r | 56194 | 7.4% |
| n | 52714 | 7.0% |
| o | 42375 | 5.6% |
| i | 41959 | 5.6% |
| l | 39764 | 5.3% |
| h | 27544 | 3.6% |
| s | 27171 | 3.6% |
| t | 25821 | 3.4% |
| Other values (42) | 306787 |
Common
| Value | Count | Frequency (%) |
| 112019 | ||
| . | 52582 | |
| , | 15496 | 8.5% |
| - | 760 | 0.4% |
| ( | 363 | 0.2% |
| ) | 361 | 0.2% |
| ' | 222 | 0.1% |
| / | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 937542 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 112019 | 11.9% | |
| e | 70978 | 7.6% |
| a | 64422 | 6.9% |
| r | 56194 | 6.0% |
| n | 52714 | 5.6% |
| . | 52582 | 5.6% |
| o | 42375 | 4.5% |
| i | 41959 | 4.5% |
| l | 39764 | 4.2% |
| h | 27544 | 2.9% |
| Other values (50) | 376991 |
CASH_FEES
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 8754 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 97.297325 |
| Minimum | 0 |
|---|---|
| Maximum | 4100.385 |
| Zeros | 3378 |
| Zeros (%) | 6.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 65.5 |
| median | 98 |
| Q3 | 122.283 |
| 95-th percentile | 180.8294 |
| Maximum | 4100.385 |
| Range | 4100.385 |
| Interquartile range (IQR) | 56.783 |
Descriptive statistics
| Standard deviation | 71.03079 |
|---|---|
| Coefficient of variation (CV) | 0.73003847 |
| Kurtosis | 490.96227 |
| Mean | 97.297325 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 12.643894 |
| Sum | 4791601.3 |
| Variance | 5045.3731 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3378 | 6.9% |
| 100 | 1973 | 4.0% |
| 110 | 1253 | 2.5% |
| 120 | 1139 | 2.3% |
| 90 | 1116 | 2.3% |
| 115 | 1030 | 2.1% |
| 75 | 1028 | 2.1% |
| 125 | 943 | 1.9% |
| 80 | 758 | 1.5% |
| 85 | 725 | 1.5% |
| Other values (8744) | 35904 |
| Value | Count | Frequency (%) |
| 0 | 3378 | |
| 0.007 | 2 | < 0.1% |
| 0.015 | 3 | < 0.1% |
| 0.021 | 2 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| 0.03 | 1 | < 0.1% |
| 0.031 | 1 | < 0.1% |
| 0.037 | 2 | < 0.1% |
| 0.04 | 2 | < 0.1% |
| 0.061 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4100.385 | 1 | < 0.1% |
| 3450 | 1 | < 0.1% |
| 3275.385 | 1 | < 0.1% |
| 2451.236 | 1 | < 0.1% |
| 2395 | 1 | < 0.1% |
| 1800 | 3 | |
| 1625 | 2 | |
| 1575 | 4 | |
| 1250 | 3 | |
| 1000 | 4 |
STOCK_AWARDS
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 7540 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 170.82242 |
| Minimum | 0 |
|---|---|
| Maximum | 1927510.7 |
| Zeros | 7479 |
| Zeros (%) | 15.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 75.264 |
| median | 129.376 |
| Q3 | 170.52 |
| 95-th percentile | 265.629 |
| Maximum | 1927510.7 |
| Range | 1927510.7 |
| Interquartile range (IQR) | 95.256 |
Descriptive statistics
| Standard deviation | 8688.9804 |
|---|---|
| Coefficient of variation (CV) | 50.865574 |
| Kurtosis | 49160.19 |
| Mean | 170.82242 |
| Median Absolute Deviation (MAD) | 45.624 |
| Skewness | 221.62636 |
| Sum | 8412491.7 |
| Variance | 75498380 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7479 | 15.2% |
| 150 | 984 | 2.0% |
| 100 | 612 | 1.2% |
| 125 | 582 | 1.2% |
| 160 | 526 | 1.1% |
| 120 | 499 | 1.0% |
| 140 | 467 | 0.9% |
| 175 | 462 | 0.9% |
| 130 | 453 | 0.9% |
| 110 | 313 | 0.6% |
| Other values (7530) | 36870 |
| Value | Count | Frequency (%) |
| 0 | 7479 | |
| 0.311 | 1 | < 0.1% |
| 0.512 | 1 | < 0.1% |
| 0.598 | 1 | < 0.1% |
| 0.715 | 1 | < 0.1% |
| 0.878 | 1 | < 0.1% |
| 1.272 | 2 | < 0.1% |
| 1.444 | 1 | < 0.1% |
| 1.712 | 1 | < 0.1% |
| 2.377 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1927510.711 | 1 | |
| 43560 | 1 | |
| 27516.225 | 1 | |
| 7397.669 | 1 | |
| 6081.243 | 1 | |
| 4333.238 | 1 | |
| 3437.426 | 1 | |
| 3249.972 | 1 | |
| 3249.971 | 1 | |
| 3249.961 | 1 |
OPTION_AWARDS
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1579 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.872265 |
| Minimum | 0 |
|---|---|
| Maximum | 23098.558 |
| Zeros | 42377 |
| Zeros (%) | 86.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 99.986 |
| Maximum | 23098.558 |
| Range | 23098.558 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 263.91548 |
|---|---|
| Coefficient of variation (CV) | 11.055318 |
| Kurtosis | 3053.2969 |
| Mean | 23.872265 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 48.189938 |
| Sum | 1175637.4 |
| Variance | 69651.38 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42377 | |
| 50 | 73 | 0.1% |
| 65 | 69 | 0.1% |
| 70 | 57 | 0.1% |
| 75 | 45 | 0.1% |
| 110 | 33 | 0.1% |
| 120 | 32 | 0.1% |
| 100 | 32 | 0.1% |
| 23.996 | 31 | 0.1% |
| 50.001 | 30 | 0.1% |
| Other values (1569) | 6468 | 13.1% |
| Value | Count | Frequency (%) |
| 0 | 42377 | |
| 0.27 | 1 | < 0.1% |
| 0.337 | 1 | < 0.1% |
| 0.394 | 1 | < 0.1% |
| 0.436 | 1 | < 0.1% |
| 0.44 | 1 | < 0.1% |
| 0.571 | 1 | < 0.1% |
| 0.621 | 1 | < 0.1% |
| 0.711 | 1 | < 0.1% |
| 0.763 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 23098.558 | 1 | |
| 20410.945 | 1 | |
| 17677.5 | 1 | |
| 13330.035 | 1 | |
| 13286.158 | 1 | |
| 13136.634 | 1 | |
| 11716.441 | 1 | |
| 9872.745 | 1 | |
| 9753.005 | 1 | |
| 9005.547 | 1 |
NONEQ_INCENT
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.36994229 |
| Minimum | 0 |
|---|---|
| Maximum | 6926.502 |
| Zeros | 49088 |
| Zeros (%) | 99.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6926.502 |
| Range | 6926.502 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 32.526511 |
|---|---|
| Coefficient of variation (CV) | 87.923202 |
| Kurtosis | 41814.683 |
| Mean | 0.36994229 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 198.02772 |
| Sum | 18218.548 |
| Variance | 1057.9739 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 49088 | |
| 275 | 11 | < 0.1% |
| 24.3 | 11 | < 0.1% |
| 27.75 | 10 | < 0.1% |
| 19.8 | 9 | < 0.1% |
| 28.2 | 9 | < 0.1% |
| 17.85 | 9 | < 0.1% |
| 28.35 | 9 | < 0.1% |
| 30 | 9 | < 0.1% |
| 18.75 | 8 | < 0.1% |
| Other values (47) | 74 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 49088 | |
| 0.15 | 2 | < 0.1% |
| 0.189 | 1 | < 0.1% |
| 2.781 | 1 | < 0.1% |
| 3.026 | 1 | < 0.1% |
| 3.743 | 1 | < 0.1% |
| 4.442 | 2 | < 0.1% |
| 4.594 | 1 | < 0.1% |
| 5.076 | 1 | < 0.1% |
| 5.575 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6926.502 | 1 | < 0.1% |
| 1283.392 | 1 | < 0.1% |
| 900.154 | 1 | < 0.1% |
| 458.341 | 1 | < 0.1% |
| 432.6 | 1 | < 0.1% |
| 343.962 | 1 | < 0.1% |
| 275 | 11 | |
| 200 | 1 | < 0.1% |
| 166.146 | 3 | < 0.1% |
| 132.376 | 1 | < 0.1% |
PENSION_CHG
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1561 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0544507 |
| Minimum | -805.309 |
|---|---|
| Maximum | 2420 |
| Zeros | 47593 |
| Zeros (%) | 96.6% |
| Negative | 14 |
| Negative (%) | < 0.1% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | -805.309 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2420 |
| Range | 3225.309 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 19.820273 |
|---|---|
| Coefficient of variation (CV) | 18.796775 |
| Kurtosis | 5597.0123 |
| Mean | 1.0544507 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 58.117773 |
| Sum | 51928.536 |
| Variance | 392.84323 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 47593 | |
| 0.003 | 6 | < 0.1% |
| 0.002 | 5 | < 0.1% |
| 0.005 | 5 | < 0.1% |
| 0.019 | 3 | < 0.1% |
| 0.883 | 3 | < 0.1% |
| 0.035 | 3 | < 0.1% |
| 1.436 | 3 | < 0.1% |
| 14.797 | 3 | < 0.1% |
| 0.762 | 3 | < 0.1% |
| Other values (1551) | 1620 | 3.3% |
| Value | Count | Frequency (%) |
| -805.309 | 1 | |
| -80.528 | 1 | |
| -50.396 | 1 | |
| -31.567 | 1 | |
| -28.603 | 1 | |
| -25.922 | 1 | |
| -20.189 | 1 | |
| -13.348 | 1 | |
| -8.872 | 1 | |
| -7.633 | 1 |
| Value | Count | Frequency (%) |
| 2420 | 1 | |
| 1254.329 | 1 | |
| 1060.724 | 1 | |
| 1045.873 | 1 | |
| 969.203 | 1 | |
| 913.982 | 1 | |
| 775.329 | 1 | |
| 680.55 | 1 | |
| 627.545 | 1 | |
| 621.999 | 1 |
OTHCOMP
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 8240 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.793034 |
| Minimum | -571.2 |
|---|---|
| Maximum | 47502.388 |
| Zeros | 31272 |
| Zeros (%) | 63.5% |
| Negative | 7 |
| Negative (%) | < 0.1% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | -571.2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 4.6945 |
| 95-th percentile | 38.1037 |
| Maximum | 47502.388 |
| Range | 48073.588 |
| Interquartile range (IQR) | 4.6945 |
Descriptive statistics
| Standard deviation | 249.2467 |
|---|---|
| Coefficient of variation (CV) | 18.070476 |
| Kurtosis | 26937.585 |
| Mean | 13.793034 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 146.20047 |
| Sum | 679265.56 |
| Variance | 62123.917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 31272 | |
| 10 | 1283 | 2.6% |
| 5 | 730 | 1.5% |
| 20 | 399 | 0.8% |
| 15 | 365 | 0.7% |
| 7.5 | 212 | 0.4% |
| 25 | 196 | 0.4% |
| 2.5 | 144 | 0.3% |
| 50 | 137 | 0.3% |
| 30 | 133 | 0.3% |
| Other values (8230) | 14376 |
| Value | Count | Frequency (%) |
| -571.2 | 1 | < 0.1% |
| -252 | 1 | < 0.1% |
| -44.836 | 1 | < 0.1% |
| -41.801 | 1 | < 0.1% |
| -41.548 | 1 | < 0.1% |
| -41.092 | 1 | < 0.1% |
| -26.392 | 1 | < 0.1% |
| 0 | 31272 | |
| 0.001 | 9 | < 0.1% |
| 0.002 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 47502.388 | 1 | |
| 10868.464 | 1 | |
| 9500 | 1 | |
| 7688.89 | 1 | |
| 7566.196 | 1 | |
| 5265.101 | 1 | |
| 5071.726 | 1 | |
| 5000 | 1 | |
| 4673.611 | 1 | |
| 4647.183 | 1 |
TOTAL_SEC
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 31751 |
|---|---|
| Distinct (%) | 64.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 306.38494 |
| Minimum | -0.002 |
|---|---|
| Maximum | 1927510.7 |
| Zeros | 1693 |
| Zeros (%) | 3.4% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | -0.002 |
|---|---|
| 5-th percentile | 25.5501 |
| Q1 | 199.673 |
| median | 253.147 |
| Q3 | 302.88 |
| 95-th percentile | 437.2434 |
| Maximum | 1927510.7 |
| Range | 1927510.7 |
| Interquartile range (IQR) | 103.207 |
Descriptive statistics
| Standard deviation | 8696.5284 |
|---|---|
| Coefficient of variation (CV) | 28.384321 |
| Kurtosis | 48975.969 |
| Mean | 306.38494 |
| Median Absolute Deviation (MAD) | 52.145 |
| Skewness | 221.00818 |
| Sum | 15088539 |
| Variance | 75629606 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1693 | 3.4% |
| 250 | 177 | 0.4% |
| 240 | 166 | 0.3% |
| 300 | 155 | 0.3% |
| 260 | 154 | 0.3% |
| 200 | 142 | 0.3% |
| 275 | 138 | 0.3% |
| 220 | 134 | 0.3% |
| 280 | 126 | 0.3% |
| 270 | 125 | 0.3% |
| Other values (31741) | 46237 |
| Value | Count | Frequency (%) |
| -0.002 | 1 | < 0.1% |
| 0 | 1693 | |
| 0.001 | 2 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| 0.259 | 1 | < 0.1% |
| 0.299 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.513 | 1 | < 0.1% |
| 0.53 | 1 | < 0.1% |
| 0.563 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1927510.712 | 1 | |
| 47502.388 | 1 | |
| 43682.359 | 1 | |
| 30805.26 | 1 | |
| 23207.558 | 1 | |
| 20514.945 | 1 | |
| 17735.283 | 1 | |
| 14832.532 | 1 | |
| 13430.035 | 1 | |
| 13323.658 | 1 |
YEAR
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2014.6417 |
| Minimum | 2010 |
|---|---|
| Maximum | 2019 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 2010 |
|---|---|
| 5-th percentile | 2010 |
| Q1 | 2012 |
| median | 2015 |
| Q3 | 2017 |
| 95-th percentile | 2019 |
| Maximum | 2019 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8685722 |
|---|---|
| Coefficient of variation (CV) | 0.0014238622 |
| Kurtosis | -1.2164314 |
| Mean | 2014.6417 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.060738892 |
| Sum | 99215060 |
| Variance | 8.2287063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2019 | 5310 | |
| 2018 | 5232 | |
| 2016 | 5096 | |
| 2017 | 5068 | |
| 2015 | 5000 | |
| 2014 | 4868 | |
| 2013 | 4812 | |
| 2012 | 4676 | |
| 2011 | 4632 | |
| 2010 | 4553 |
| Value | Count | Frequency (%) |
| 2010 | 4553 | |
| 2011 | 4632 | |
| 2012 | 4676 | |
| 2013 | 4812 | |
| 2014 | 4868 | |
| 2015 | 5000 | |
| 2016 | 5096 | |
| 2017 | 5068 | |
| 2018 | 5232 | |
| 2019 | 5310 |
| Value | Count | Frequency (%) |
| 2019 | 5310 | |
| 2018 | 5232 | |
| 2017 | 5068 | |
| 2016 | 5096 | |
| 2015 | 5000 | |
| 2014 | 4868 | |
| 2013 | 4812 | |
| 2012 | 4676 | |
| 2011 | 4632 | |
| 2010 | 4553 |
CONAME
Categorical
| Distinct | 495 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| CME GROUP INC | 264 |
|---|---|
| TRUIST FINANCIAL CORP | 179 |
| GENERAL ELECTRIC CO | 170 |
| WELLS FARGO & CO | 161 |
| BANK OF AMERICA CORP | 157 |
| Other values (490) |
Length
| Max length | 28 |
|---|---|
| Median length | 20 |
| Mean length | 17.177757 |
| Min length | 5 |
Characters and Unicode
| Total characters | 845953 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AMERICAN AIRLINES GROUP INC |
|---|---|
| 2nd row | AMERICAN AIRLINES GROUP INC |
| 3rd row | AMERICAN AIRLINES GROUP INC |
| 4th row | AMERICAN AIRLINES GROUP INC |
| 5th row | AMERICAN AIRLINES GROUP INC |
Common Values
| Value | Count | Frequency (%) |
| CME GROUP INC | 264 | 0.5% |
| TRUIST FINANCIAL CORP | 179 | 0.4% |
| GENERAL ELECTRIC CO | 170 | 0.3% |
| WELLS FARGO & CO | 161 | 0.3% |
| BANK OF AMERICA CORP | 157 | 0.3% |
| PNC FINANCIAL SVCS GROUP INC | 156 | 0.3% |
| COCA-COLA CO | 154 | 0.3% |
| CITIGROUP INC | 153 | 0.3% |
| US BANCORP | 153 | 0.3% |
| CBOE GLOBAL MARKETS INC | 152 | 0.3% |
| Other values (485) | 47548 |
Length
| Value | Count | Frequency (%) |
| inc | 24191 | 17.4% |
| corp | 11521 | 8.3% |
| co | 5348 | 3.9% |
| group | 2676 | 1.9% |
| 2364 | 1.7% | |
| energy | 1886 | 1.4% |
| financial | 1650 | 1.2% |
| plc | 1448 | 1.0% |
| technologies | 1119 | 0.8% |
| holdings | 1054 | 0.8% |
| Other values (699) | 85410 |
Most occurring characters
| Value | Count | Frequency (%) |
| 89696 | ||
| C | 75849 | 9.0% |
| N | 74174 | 8.8% |
| I | 68352 | 8.1% |
| E | 65656 | 7.8% |
| O | 63991 | 7.6% |
| R | 61254 | 7.2% |
| A | 52302 | 6.2% |
| S | 40454 | 4.8% |
| T | 40376 | 4.8% |
| Other values (27) | 213849 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 747222 | |
| Space Separator | 89696 | 10.6% |
| Other Punctuation | 3799 | 0.4% |
| Dash Punctuation | 1935 | 0.2% |
| Close Punctuation | 1433 | 0.2% |
| Open Punctuation | 1433 | 0.2% |
| Decimal Number | 435 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 75849 | |
| N | 74174 | |
| I | 68352 | |
| E | 65656 | 8.8% |
| O | 63991 | 8.6% |
| R | 61254 | 8.2% |
| A | 52302 | 7.0% |
| S | 40454 | 5.4% |
| T | 40376 | 5.4% |
| L | 38277 | 5.1% |
| Other values (16) | 166537 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 2748 | |
| ' | 484 | 12.7% |
| . | 392 | 10.3% |
| / | 175 | 4.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 226 | |
| 6 | 128 | |
| 5 | 81 | 18.6% |
Space Separator
| Value | Count | Frequency (%) |
| 89696 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1935 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1433 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1433 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 747222 | |
| Common | 98731 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 75849 | |
| N | 74174 | |
| I | 68352 | |
| E | 65656 | 8.8% |
| O | 63991 | 8.6% |
| R | 61254 | 8.2% |
| A | 52302 | 7.0% |
| S | 40454 | 5.4% |
| T | 40376 | 5.4% |
| L | 38277 | 5.1% |
| Other values (16) | 166537 |
Common
| Value | Count | Frequency (%) |
| 89696 | ||
| & | 2748 | 2.8% |
| - | 1935 | 2.0% |
| ) | 1433 | 1.5% |
| ( | 1433 | 1.5% |
| ' | 484 | 0.5% |
| . | 392 | 0.4% |
| 3 | 226 | 0.2% |
| / | 175 | 0.2% |
| 6 | 128 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 845953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 89696 | ||
| C | 75849 | 9.0% |
| N | 74174 | 8.8% |
| I | 68352 | 8.1% |
| E | 65656 | 7.8% |
| O | 63991 | 7.6% |
| R | 61254 | 7.2% |
| A | 52302 | 6.2% |
| S | 40454 | 4.8% |
| T | 40376 | 4.8% |
| Other values (27) | 213849 |
CUSIP
Categorical
| Distinct | 495 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| 12572Q10 | 264 |
|---|---|
| 89832Q10 | 179 |
| 36960430 | 170 |
| 94974610 | 161 |
| 06050510 | 157 |
| Other values (490) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 393976 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 02376R10 |
|---|---|
| 2nd row | 02376R10 |
| 3rd row | 02376R10 |
| 4th row | 02376R10 |
| 5th row | 02376R10 |
Common Values
| Value | Count | Frequency (%) |
| 12572Q10 | 264 | 0.5% |
| 89832Q10 | 179 | 0.4% |
| 36960430 | 170 | 0.3% |
| 94974610 | 161 | 0.3% |
| 06050510 | 157 | 0.3% |
| 69347510 | 156 | 0.3% |
| 19121610 | 154 | 0.3% |
| 17296742 | 153 | 0.3% |
| 90297330 | 153 | 0.3% |
| 12503M10 | 152 | 0.3% |
| Other values (485) | 47548 |
Length
| Value | Count | Frequency (%) |
| 12572q10 | 264 | 0.5% |
| 89832q10 | 179 | 0.4% |
| 36960430 | 170 | 0.3% |
| 94974610 | 161 | 0.3% |
| 06050510 | 157 | 0.3% |
| 69347510 | 156 | 0.3% |
| 19121610 | 154 | 0.3% |
| 17296742 | 153 | 0.3% |
| 90297330 | 153 | 0.3% |
| 12503m10 | 152 | 0.3% |
| Other values (485) | 47548 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 80954 | |
| 1 | 75076 | |
| 4 | 30261 | 7.7% |
| 2 | 29169 | 7.4% |
| 6 | 28519 | 7.2% |
| 3 | 28498 | 7.2% |
| 5 | 27958 | 7.1% |
| 7 | 25370 | 6.4% |
| 8 | 23851 | 6.1% |
| 9 | 22850 | 5.8% |
| Other values (23) | 21470 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 372506 | |
| Uppercase Letter | 21470 | 5.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2671 | 12.4% |
| L | 1577 | 7.3% |
| R | 1305 | 6.1% |
| C | 1272 | 5.9% |
| V | 1266 | 5.9% |
| E | 1187 | 5.5% |
| P | 1172 | 5.5% |
| T | 1131 | 5.3% |
| H | 1090 | 5.1% |
| Q | 1068 | 5.0% |
| Other values (13) | 7731 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 80954 | |
| 1 | 75076 | |
| 4 | 30261 | 8.1% |
| 2 | 29169 | 7.8% |
| 6 | 28519 | 7.7% |
| 3 | 28498 | 7.7% |
| 5 | 27958 | 7.5% |
| 7 | 25370 | 6.8% |
| 8 | 23851 | 6.4% |
| 9 | 22850 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 372506 | |
| Latin | 21470 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 2671 | 12.4% |
| L | 1577 | 7.3% |
| R | 1305 | 6.1% |
| C | 1272 | 5.9% |
| V | 1266 | 5.9% |
| E | 1187 | 5.5% |
| P | 1172 | 5.5% |
| T | 1131 | 5.3% |
| H | 1090 | 5.1% |
| Q | 1068 | 5.0% |
| Other values (13) | 7731 |
Common
| Value | Count | Frequency (%) |
| 0 | 80954 | |
| 1 | 75076 | |
| 4 | 30261 | 8.1% |
| 2 | 29169 | 7.8% |
| 6 | 28519 | 7.7% |
| 3 | 28498 | 7.7% |
| 5 | 27958 | 7.5% |
| 7 | 25370 | 6.8% |
| 8 | 23851 | 6.4% |
| 9 | 22850 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 393976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 80954 | |
| 1 | 75076 | |
| 4 | 30261 | 7.7% |
| 2 | 29169 | 7.4% |
| 6 | 28519 | 7.2% |
| 3 | 28498 | 7.2% |
| 5 | 27958 | 7.1% |
| 7 | 25370 | 6.4% |
| 8 | 23851 | 6.1% |
| 9 | 22850 | 5.8% |
| Other values (23) | 21470 | 5.4% |
EXCHANGE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| NYS | |
|---|---|
| NAS | |
| OTH | 152 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 147741 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NAS |
|---|---|
| 2nd row | NAS |
| 3rd row | NAS |
| 4th row | NAS |
| 5th row | NAS |
Common Values
| Value | Count | Frequency (%) |
| NYS | 35321 | |
| NAS | 13774 | 28.0% |
| OTH | 152 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nys | 35321 | |
| nas | 13774 | 28.0% |
| oth | 152 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 49095 | |
| S | 49095 | |
| Y | 35321 | |
| A | 13774 | 9.3% |
| O | 152 | 0.1% |
| T | 152 | 0.1% |
| H | 152 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 147741 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 49095 | |
| S | 49095 | |
| Y | 35321 | |
| A | 13774 | 9.3% |
| O | 152 | 0.1% |
| T | 152 | 0.1% |
| H | 152 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 147741 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 49095 | |
| S | 49095 | |
| Y | 35321 | |
| A | 13774 | 9.3% |
| O | 152 | 0.1% |
| T | 152 | 0.1% |
| H | 152 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 147741 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 49095 | |
| S | 49095 | |
| Y | 35321 | |
| A | 13774 | 9.3% |
| O | 152 | 0.1% |
| T | 152 | 0.1% |
| H | 152 | 0.1% |
ADDRESS
Categorical
| Distinct | 492 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| 20 South Wacker Drive | 264 |
|---|---|
| One Energy Plaza | 236 |
| One PPG Place | 186 |
| 214 North Tryon Street | 179 |
| 5 Necco Street | 170 |
| Other values (487) |
Length
| Max length | 60 |
|---|---|
| Median length | 48 |
| Mean length | 24.314638 |
| Min length | 9 |
Characters and Unicode
| Total characters | 1197423 |
|---|---|
| Distinct characters | 68 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 Skyview Drive |
|---|---|
| 2nd row | 1 Skyview Drive |
| 3rd row | 1 Skyview Drive |
| 4th row | 1 Skyview Drive |
| 5th row | 1 Skyview Drive |
Common Values
| Value | Count | Frequency (%) |
| 20 South Wacker Drive | 264 | 0.5% |
| One Energy Plaza | 236 | 0.5% |
| One PPG Place | 186 | 0.4% |
| 214 North Tryon Street | 179 | 0.4% |
| 5 Necco Street | 170 | 0.3% |
| 420 Montgomery Street | 161 | 0.3% |
| Bank of America Corporate Center, 100 North Tryon Street | 157 | 0.3% |
| The Tower at PNC Plaza, 300 Fifth Avenue | 156 | 0.3% |
| One Coca-Cola Plaza | 154 | 0.3% |
| 800 Nicollet Mall | 153 | 0.3% |
| Other values (482) | 47431 |
Length
| Value | Count | Frequency (%) |
| street | 11044 | 5.4% |
| avenue | 7266 | 3.5% |
| suite | 6537 | 3.2% |
| drive | 6142 | 3.0% |
| road | 5826 | 2.8% |
| one | 4628 | 2.3% |
| boulevard | 4518 | 2.2% |
| south | 4003 | 2.0% |
| west | 3809 | 1.9% |
| north | 2910 | 1.4% |
| Other values (894) | 148384 |
Most occurring characters
| Value | Count | Frequency (%) |
| 155820 | 13.0% | |
| e | 113033 | 9.4% |
| t | 67341 | 5.6% |
| a | 67023 | 5.6% |
| r | 65614 | 5.5% |
| 0 | 55120 | 4.6% |
| o | 53405 | 4.5% |
| n | 48417 | 4.0% |
| i | 41680 | 3.5% |
| l | 32464 | 2.7% |
| Other values (58) | 497506 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 685228 | |
| Decimal Number | 183067 | 15.3% |
| Space Separator | 155820 | 13.0% |
| Uppercase Letter | 154846 | 12.9% |
| Other Punctuation | 17924 | 1.5% |
| Dash Punctuation | 538 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 26839 | |
| P | 15145 | 9.8% |
| B | 11935 | 7.7% |
| A | 11189 | 7.2% |
| C | 10673 | 6.9% |
| W | 10397 | 6.7% |
| O | 8123 | 5.2% |
| D | 8096 | 5.2% |
| R | 7967 | 5.1% |
| N | 6508 | 4.2% |
| Other values (16) | 37974 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 113033 | |
| t | 67341 | |
| a | 67023 | |
| r | 65614 | |
| o | 53405 | 7.8% |
| n | 48417 | 7.1% |
| i | 41680 | 6.1% |
| l | 32464 | 4.7% |
| u | 32040 | 4.7% |
| s | 24356 | 3.6% |
| Other values (15) | 139855 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 55120 | |
| 1 | 31333 | |
| 5 | 20074 | 11.0% |
| 2 | 17849 | 9.7% |
| 3 | 13603 | 7.4% |
| 4 | 10083 | 5.5% |
| 7 | 9729 | 5.3% |
| 9 | 8625 | 4.7% |
| 6 | 8595 | 4.7% |
| 8 | 8056 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 15143 | |
| . | 1844 | 10.3% |
| & | 373 | 2.1% |
| ' | 342 | 1.9% |
| / | 222 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 155820 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 538 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 840074 | |
| Common | 357349 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 113033 | 13.5% |
| t | 67341 | 8.0% |
| a | 67023 | 8.0% |
| r | 65614 | 7.8% |
| o | 53405 | 6.4% |
| n | 48417 | 5.8% |
| i | 41680 | 5.0% |
| l | 32464 | 3.9% |
| u | 32040 | 3.8% |
| S | 26839 | 3.2% |
| Other values (41) | 292218 |
Common
| Value | Count | Frequency (%) |
| 155820 | ||
| 0 | 55120 | 15.4% |
| 1 | 31333 | 8.8% |
| 5 | 20074 | 5.6% |
| 2 | 17849 | 5.0% |
| , | 15143 | 4.2% |
| 3 | 13603 | 3.8% |
| 4 | 10083 | 2.8% |
| 7 | 9729 | 2.7% |
| 9 | 8625 | 2.4% |
| Other values (7) | 19970 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1197423 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 155820 | 13.0% | |
| e | 113033 | 9.4% |
| t | 67341 | 5.6% |
| a | 67023 | 5.6% |
| r | 65614 | 5.5% |
| 0 | 55120 | 4.6% |
| o | 53405 | 4.5% |
| n | 48417 | 4.0% |
| i | 41680 | 3.5% |
| l | 32464 | 2.7% |
| Other values (58) | 497506 |
CITY
Categorical
| Distinct | 229 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| New York | |
|---|---|
| Houston | 1860 |
| Atlanta | 1722 |
| Chicago | 1705 |
| Dallas | 1103 |
| Other values (224) |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 8.4628708 |
| Min length | 4 |
Characters and Unicode
| Total characters | 416771 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fort Worth |
|---|---|
| 2nd row | Fort Worth |
| 3rd row | Fort Worth |
| 4th row | Fort Worth |
| 5th row | Fort Worth |
Common Values
| Value | Count | Frequency (%) |
| New York | 3958 | 8.0% |
| Houston | 1860 | 3.8% |
| Atlanta | 1722 | 3.5% |
| Chicago | 1705 | 3.5% |
| Dallas | 1103 | 2.2% |
| Dublin | 1037 | 2.1% |
| Charlotte | 980 | 2.0% |
| San Jose | 766 | 1.6% |
| Boston | 746 | 1.5% |
| Santa Clara | 708 | 1.4% |
| Other values (219) | 34662 |
Length
| Value | Count | Frequency (%) |
| new | 4292 | 6.9% |
| york | 3958 | 6.3% |
| san | 2556 | 4.1% |
| chicago | 1892 | 3.0% |
| houston | 1860 | 3.0% |
| atlanta | 1722 | 2.7% |
| dallas | 1103 | 1.8% |
| dublin | 1037 | 1.7% |
| charlotte | 980 | 1.6% |
| santa | 842 | 1.3% |
| Other values (247) | 42405 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 37403 | 9.0% |
| o | 35557 | 8.5% |
| n | 34047 | 8.2% |
| e | 32721 | 7.9% |
| i | 26102 | 6.3% |
| l | 24276 | 5.8% |
| t | 23723 | 5.7% |
| r | 21971 | 5.3% |
| s | 18307 | 4.4% |
| 13400 | 3.2% | |
| Other values (39) | 149264 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 340507 | |
| Uppercase Letter | 62864 | 15.1% |
| Space Separator | 13400 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 37403 | |
| o | 35557 | |
| n | 34047 | |
| e | 32721 | |
| i | 26102 | 7.7% |
| l | 24276 | 7.1% |
| t | 23723 | 7.0% |
| r | 21971 | 6.5% |
| s | 18307 | 5.4% |
| h | 11256 | 3.3% |
| Other values (15) | 75144 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7577 | |
| S | 6535 | 10.4% |
| N | 5596 | 8.9% |
| M | 4444 | 7.1% |
| D | 4302 | 6.8% |
| Y | 3958 | 6.3% |
| B | 3928 | 6.2% |
| A | 3660 | 5.8% |
| P | 2881 | 4.6% |
| H | 2838 | 4.5% |
| Other values (13) | 17145 |
Space Separator
| Value | Count | Frequency (%) |
| 13400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 403371 | |
| Common | 13400 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 37403 | 9.3% |
| o | 35557 | 8.8% |
| n | 34047 | 8.4% |
| e | 32721 | 8.1% |
| i | 26102 | 6.5% |
| l | 24276 | 6.0% |
| t | 23723 | 5.9% |
| r | 21971 | 5.4% |
| s | 18307 | 4.5% |
| h | 11256 | 2.8% |
| Other values (38) | 138008 |
Common
| Value | Count | Frequency (%) |
| 13400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 416771 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 37403 | 9.0% |
| o | 35557 | 8.5% |
| n | 34047 | 8.2% |
| e | 32721 | 7.9% |
| i | 26102 | 6.3% |
| l | 24276 | 5.8% |
| t | 23723 | 5.7% |
| r | 21971 | 5.3% |
| s | 18307 | 4.4% |
| 13400 | 3.2% | |
| Other values (39) | 149264 |
STATE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2082 |
| Missing (%) | 4.2% |
| Memory size | 384.9 KiB |
| CA | |
|---|---|
| NY | |
| TX | |
| IL | |
| OH | 2336 |
| Other values (33) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 94330 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TX |
|---|---|
| 2nd row | TX |
| 3rd row | TX |
| 4th row | TX |
| 5th row | TX |
Common Values
| Value | Count | Frequency (%) |
| CA | 6191 | 12.6% |
| NY | 5187 | 10.5% |
| TX | 4389 | 8.9% |
| IL | 3375 | 6.9% |
| OH | 2336 | 4.7% |
| MA | 2049 | 4.2% |
| PA | 1951 | 4.0% |
| GA | 1920 | 3.9% |
| VA | 1769 | 3.6% |
| NC | 1666 | 3.4% |
| Other values (28) | 16332 | |
| (Missing) | 2082 | 4.2% |
Length
| Value | Count | Frequency (%) |
| ca | 6191 | 13.1% |
| ny | 5187 | 11.0% |
| tx | 4389 | 9.3% |
| il | 3375 | 7.2% |
| oh | 2336 | 5.0% |
| ma | 2049 | 4.3% |
| pa | 1951 | 4.1% |
| ga | 1920 | 4.1% |
| va | 1769 | 3.8% |
| nc | 1666 | 3.5% |
| Other values (28) | 16332 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 16740 | |
| N | 11859 | |
| C | 10109 | |
| I | 6553 | 6.9% |
| T | 6518 | 6.9% |
| M | 5845 | 6.2% |
| Y | 5499 | 5.8% |
| L | 5443 | 5.8% |
| X | 4389 | 4.7% |
| O | 4240 | 4.5% |
| Other values (13) | 17135 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 94330 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 16740 | |
| N | 11859 | |
| C | 10109 | |
| I | 6553 | 6.9% |
| T | 6518 | 6.9% |
| M | 5845 | 6.2% |
| Y | 5499 | 5.8% |
| L | 5443 | 5.8% |
| X | 4389 | 4.7% |
| O | 4240 | 4.5% |
| Other values (13) | 17135 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 94330 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 16740 | |
| N | 11859 | |
| C | 10109 | |
| I | 6553 | 6.9% |
| T | 6518 | 6.9% |
| M | 5845 | 6.2% |
| Y | 5499 | 5.8% |
| L | 5443 | 5.8% |
| X | 4389 | 4.7% |
| O | 4240 | 4.5% |
| Other values (13) | 17135 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 16740 | |
| N | 11859 | |
| C | 10109 | |
| I | 6553 | 6.9% |
| T | 6518 | 6.9% |
| M | 5845 | 6.2% |
| Y | 5499 | 5.8% |
| L | 5443 | 5.8% |
| X | 4389 | 4.7% |
| O | 4240 | 4.5% |
| Other values (13) | 17135 |
ZIP
Categorical
| Distinct | 384 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 113 |
| Missing (%) | 0.2% |
| Memory size | 384.9 KiB |
| 10036 | 913 |
|---|---|
| 60606 | 530 |
| 95054 | 524 |
| 77002 | 524 |
| 30328 | 473 |
| Other values (379) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.9741523 |
| Min length | 1 |
Characters and Unicode
| Total characters | 244400 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 76155 |
|---|---|
| 2nd row | 76155 |
| 3rd row | 76155 |
| 4th row | 76155 |
| 5th row | 76155 |
Common Values
| Value | Count | Frequency (%) |
| 10036 | 913 | 1.9% |
| 60606 | 530 | 1.1% |
| 95054 | 524 | 1.1% |
| 77002 | 524 | 1.1% |
| 30328 | 473 | 1.0% |
| 10001 | 467 | 0.9% |
| 60015 | 443 | 0.9% |
| 28202 | 435 | 0.9% |
| 75039 | 419 | 0.9% |
| 20190 | 412 | 0.8% |
| Other values (374) | 43994 |
Length
| Value | Count | Frequency (%) |
| 10036 | 913 | 1.8% |
| 60606 | 530 | 1.1% |
| 95054 | 524 | 1.0% |
| 77002 | 524 | 1.0% |
| 30328 | 473 | 0.9% |
| 10001 | 467 | 0.9% |
| d02 | 462 | 0.9% |
| 60015 | 443 | 0.9% |
| 28202 | 435 | 0.9% |
| 75039 | 419 | 0.8% |
| Other values (379) | 44937 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 50247 | |
| 1 | 32535 | |
| 2 | 32133 | |
| 3 | 21339 | |
| 4 | 20382 | |
| 5 | 19324 | 7.9% |
| 6 | 17285 | 7.1% |
| 7 | 17268 | 7.1% |
| 9 | 15157 | 6.2% |
| 8 | 14994 | 6.1% |
| Other values (17) | 3736 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 240664 | |
| Uppercase Letter | 2743 | 1.1% |
| Space Separator | 993 | 0.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 711 | |
| M | 283 | 10.3% |
| T | 218 | 7.9% |
| H | 168 | 6.1% |
| V | 143 | 5.2% |
| Y | 126 | 4.6% |
| K | 123 | 4.5% |
| C | 115 | 4.2% |
| E | 115 | 4.2% |
| P | 114 | 4.2% |
| Other values (6) | 627 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 50247 | |
| 1 | 32535 | |
| 2 | 32133 | |
| 3 | 21339 | |
| 4 | 20382 | |
| 5 | 19324 | 8.0% |
| 6 | 17285 | 7.2% |
| 7 | 17268 | 7.2% |
| 9 | 15157 | 6.3% |
| 8 | 14994 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 993 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 241657 | |
| Latin | 2743 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 711 | |
| M | 283 | 10.3% |
| T | 218 | 7.9% |
| H | 168 | 6.1% |
| V | 143 | 5.2% |
| Y | 126 | 4.6% |
| K | 123 | 4.5% |
| C | 115 | 4.2% |
| E | 115 | 4.2% |
| P | 114 | 4.2% |
| Other values (6) | 627 |
Common
| Value | Count | Frequency (%) |
| 0 | 50247 | |
| 1 | 32535 | |
| 2 | 32133 | |
| 3 | 21339 | |
| 4 | 20382 | |
| 5 | 19324 | 8.0% |
| 6 | 17285 | 7.2% |
| 7 | 17268 | 7.1% |
| 9 | 15157 | 6.3% |
| 8 | 14994 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 50247 | |
| 1 | 32535 | |
| 2 | 32133 | |
| 3 | 21339 | |
| 4 | 20382 | |
| 5 | 19324 | 7.9% |
| 6 | 17285 | 7.1% |
| 7 | 17268 | 7.1% |
| 9 | 15157 | 6.2% |
| 8 | 14994 | 6.1% |
| Other values (17) | 3736 | 1.5% |
SICDESC
Categorical
| Distinct | 178 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| COMMERCIAL BANKS | 2562 |
|---|---|
| REAL ESTATE INVESTMENT TRUSTS | 2422 |
| COMPUTER PROGRAMMING, DATA PROCESSING, AND OTHER C | 1741 |
| ELECTRIC AND OTHER SERVICES COMBINED | 1606 |
| SEMICONDUCTORS AND RELATED DEVICES | 1472 |
| Other values (173) |
Length
| Max length | 50 |
|---|---|
| Median length | 44 |
| Mean length | 31.699961 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1561128 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AIR TRANSPORTATION, SCHEDULED |
|---|---|
| 2nd row | AIR TRANSPORTATION, SCHEDULED |
| 3rd row | AIR TRANSPORTATION, SCHEDULED |
| 4th row | AIR TRANSPORTATION, SCHEDULED |
| 5th row | AIR TRANSPORTATION, SCHEDULED |
Common Values
| Value | Count | Frequency (%) |
| COMMERCIAL BANKS | 2562 | 5.2% |
| REAL ESTATE INVESTMENT TRUSTS | 2422 | 4.9% |
| COMPUTER PROGRAMMING, DATA PROCESSING, AND OTHER C | 1741 | 3.5% |
| ELECTRIC AND OTHER SERVICES COMBINED | 1606 | 3.3% |
| SEMICONDUCTORS AND RELATED DEVICES | 1472 | 3.0% |
| ELECTRIC SERVICES | 1376 | 2.8% |
| FIRE, MARINE, AND CASUALTY INSURANCE | 1218 | 2.5% |
| PHARMACEUTICAL PREPARATIONS | 1162 | 2.4% |
| PREPACKAGED SOFTWARE | 1053 | 2.1% |
| CRUDE PETROLEUM AND NATURAL GAS | 922 | 1.9% |
| Other values (168) | 33713 |
Length
| Value | Count | Frequency (%) |
| and | 22505 | 11.4% |
| services | 5717 | 2.9% |
| other | 4525 | 2.3% |
| 4052 | 2.1% | |
| computer | 3443 | 1.7% |
| investment | 3207 | 1.6% |
| electric | 3082 | 1.6% |
| commercial | 3061 | 1.5% |
| insurance | 2576 | 1.3% |
| banks | 2562 | 1.3% |
| Other values (368) | 142837 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 167072 | |
| 148320 | 9.5% | |
| A | 132419 | 8.5% |
| R | 115745 | 7.4% |
| S | 109167 | 7.0% |
| I | 103379 | 6.6% |
| N | 102936 | 6.6% |
| T | 102786 | 6.6% |
| C | 88575 | 5.7% |
| O | 76701 | 4.9% |
| Other values (25) | 414028 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1382178 | |
| Space Separator | 148320 | 9.5% |
| Other Punctuation | 28753 | 1.8% |
| Dash Punctuation | 1521 | 0.1% |
| Open Punctuation | 178 | < 0.1% |
| Close Punctuation | 178 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 167072 | |
| A | 132419 | |
| R | 115745 | 8.4% |
| S | 109167 | 7.9% |
| I | 103379 | 7.5% |
| N | 102936 | 7.4% |
| T | 102786 | 7.4% |
| C | 88575 | 6.4% |
| O | 76701 | 5.5% |
| D | 61223 | 4.4% |
| Other values (16) | 322175 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 23958 | |
| & | 4052 | 14.1% |
| ' | 437 | 1.5% |
| ; | 233 | 0.8% |
| : | 73 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 148320 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1521 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 178 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1382178 | |
| Common | 178950 | 11.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 167072 | |
| A | 132419 | |
| R | 115745 | 8.4% |
| S | 109167 | 7.9% |
| I | 103379 | 7.5% |
| N | 102936 | 7.4% |
| T | 102786 | 7.4% |
| C | 88575 | 6.4% |
| O | 76701 | 5.5% |
| D | 61223 | 4.4% |
| Other values (16) | 322175 |
Common
| Value | Count | Frequency (%) |
| 148320 | ||
| , | 23958 | 13.4% |
| & | 4052 | 2.3% |
| - | 1521 | 0.8% |
| ' | 437 | 0.2% |
| ; | 233 | 0.1% |
| ( | 178 | 0.1% |
| ) | 178 | 0.1% |
| : | 73 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1561128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 167072 | |
| 148320 | 9.5% | |
| A | 132419 | 8.5% |
| R | 115745 | 7.4% |
| S | 109167 | 7.0% |
| I | 103379 | 6.6% |
| N | 102936 | 6.6% |
| T | 102786 | 6.6% |
| C | 88575 | 5.7% |
| O | 76701 | 4.9% |
| Other values (25) | 414028 |
NAICSDESC
Categorical
| Distinct | 203 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| Commercial Banking | 2596 |
|---|---|
| Electric Power Generation | 1629 |
| Semiconductor and Related Device Manufacturing | 1472 |
| Electric Power Generation, Transmission and Distri | 1466 |
| Lessors of Nonresidential Buildings (except Miniwa | 1333 |
| Other values (198) |
Length
| Max length | 50 |
|---|---|
| Median length | 46 |
| Mean length | 38.158568 |
| Min length | 9 |
Characters and Unicode
| Total characters | 1879195 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Scheduled Passenger Air Transportation |
|---|---|
| 2nd row | Scheduled Passenger Air Transportation |
| 3rd row | Scheduled Passenger Air Transportation |
| 4th row | Scheduled Passenger Air Transportation |
| 5th row | Scheduled Passenger Air Transportation |
Common Values
| Value | Count | Frequency (%) |
| Commercial Banking | 2596 | 5.3% |
| Electric Power Generation | 1629 | 3.3% |
| Semiconductor and Related Device Manufacturing | 1472 | 3.0% |
| Electric Power Generation, Transmission and Distri | 1466 | 3.0% |
| Lessors of Nonresidential Buildings (except Miniwa | 1333 | 2.7% |
| Direct Property and Casualty Insurance Carriers | 1269 | 2.6% |
| Pharmaceutical Preparation Manufacturing | 1162 | 2.4% |
| Oil and Gas Extraction | 928 | 1.9% |
| Data Processing, Hosting, and Related Services (ef | 863 | 1.8% |
| Lessors of Residential Buildings and Dwellings | 863 | 1.8% |
| Other values (193) | 35666 |
Length
| Value | Count | Frequency (%) |
| and | 25531 | 11.1% |
| manufacturing | 12645 | 5.5% |
| other | 4591 | 2.0% |
| power | 3207 | 1.4% |
| electric | 3095 | 1.3% |
| generation | 3095 | 1.3% |
| eff | 3040 | 1.3% |
| banking | 3020 | 1.3% |
| insurance | 2998 | 1.3% |
| related | 2903 | 1.3% |
| Other values (434) | 166069 |
Most occurring characters
| Value | Count | Frequency (%) |
| 181307 | 9.6% | |
| e | 163260 | 8.7% |
| a | 151970 | 8.1% |
| n | 143177 | 7.6% |
| r | 136309 | 7.3% |
| i | 133443 | 7.1% |
| t | 112006 | 6.0% |
| s | 81020 | 4.3% |
| o | 78884 | 4.2% |
| c | 74965 | 4.0% |
| Other values (56) | 622854 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1461080 | |
| Uppercase Letter | 187665 | 10.0% |
| Space Separator | 181307 | 9.6% |
| Decimal Number | 17663 | 0.9% |
| Other Punctuation | 15144 | 0.8% |
| Open Punctuation | 9538 | 0.5% |
| Close Punctuation | 2546 | 0.1% |
| Dash Punctuation | 2351 | 0.1% |
| Connector Punctuation | 1901 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 163260 | |
| a | 151970 | |
| n | 143177 | |
| r | 136309 | |
| i | 133443 | |
| t | 112006 | 7.7% |
| s | 81020 | 5.5% |
| o | 78884 | 5.4% |
| c | 74965 | 5.1% |
| u | 67632 | 4.6% |
| Other values (16) | 318414 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 24961 | |
| P | 19843 | |
| S | 19550 | |
| C | 18171 | |
| D | 14041 | 7.5% |
| A | 10851 | 5.8% |
| E | 10215 | 5.4% |
| B | 9624 | 5.1% |
| I | 8494 | 4.5% |
| G | 8047 | 4.3% |
| Other values (14) | 43868 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5545 | |
| 0 | 3317 | |
| 6 | 3046 | |
| 1 | 2621 | |
| 5 | 1377 | 7.8% |
| 4 | 1052 | 6.0% |
| 3 | 513 | 2.9% |
| 8 | 192 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 9198 | |
| / | 5561 | |
| ' | 385 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 181307 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 9538 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2546 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2351 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1901 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1648745 | |
| Common | 230450 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 163260 | 9.9% |
| a | 151970 | 9.2% |
| n | 143177 | 8.7% |
| r | 136309 | 8.3% |
| i | 133443 | 8.1% |
| t | 112006 | 6.8% |
| s | 81020 | 4.9% |
| o | 78884 | 4.8% |
| c | 74965 | 4.5% |
| u | 67632 | 4.1% |
| Other values (40) | 506079 |
Common
| Value | Count | Frequency (%) |
| 181307 | ||
| ( | 9538 | 4.1% |
| , | 9198 | 4.0% |
| / | 5561 | 2.4% |
| 2 | 5545 | 2.4% |
| 0 | 3317 | 1.4% |
| 6 | 3046 | 1.3% |
| 1 | 2621 | 1.1% |
| ) | 2546 | 1.1% |
| - | 2351 | 1.0% |
| Other values (6) | 5420 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1879195 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 181307 | 9.6% | |
| e | 163260 | 8.7% |
| a | 151970 | 8.1% |
| n | 143177 | 7.6% |
| r | 136309 | 7.3% |
| i | 133443 | 7.1% |
| t | 112006 | 6.0% |
| s | 81020 | 4.3% |
| o | 78884 | 4.2% |
| c | 74965 | 4.0% |
| Other values (56) | 622854 |
INDDESC
Categorical
| Distinct | 122 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| Electric Utilities | 1970 |
|---|---|
| Regional Banks | 1602 |
| Health Care Equipment | 1458 |
| Semiconductors | 1366 |
| Packaged Foods & Meats | 1259 |
| Other values (117) |
Length
| Max length | 44 |
|---|---|
| Median length | 32 |
| Mean length | 21.542124 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1060885 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Airlines |
|---|---|
| 2nd row | Airlines |
| 3rd row | Airlines |
| 4th row | Airlines |
| 5th row | Airlines |
Common Values
| Value | Count | Frequency (%) |
| Electric Utilities | 1970 | 4.0% |
| Regional Banks | 1602 | 3.3% |
| Health Care Equipment | 1458 | 3.0% |
| Semiconductors | 1366 | 2.8% |
| Packaged Foods & Meats | 1259 | 2.6% |
| Financial Exchanges & Data | 1134 | 2.3% |
| Multi-Utilities | 1122 | 2.3% |
| Aerospace & Defense | 1113 | 2.3% |
| Life Sciences Tools & Services | 1023 | 2.1% |
| Industrial Machinery | 993 | 2.0% |
| Other values (112) | 36207 |
Length
| Value | Count | Frequency (%) |
| 20722 | 14.7% | |
| services | 5186 | 3.7% |
| health | 4233 | 3.0% |
| equipment | 3592 | 2.6% |
| care | 3547 | 2.5% |
| banks | 3250 | 2.3% |
| reits | 2520 | 1.8% |
| insurance | 2510 | 1.8% |
| gas | 2298 | 1.6% |
| oil | 2187 | 1.6% |
| Other values (175) | 90592 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 105034 | 9.9% |
| 91390 | 8.6% | |
| i | 80659 | 7.6% |
| t | 71524 | 6.7% |
| a | 71307 | 6.7% |
| s | 69041 | 6.5% |
| r | 61659 | 5.8% |
| n | 59784 | 5.6% |
| o | 49069 | 4.6% |
| c | 46157 | 4.4% |
| Other values (38) | 355261 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 816369 | |
| Uppercase Letter | 129289 | 12.2% |
| Space Separator | 91390 | 8.6% |
| Other Punctuation | 22259 | 2.1% |
| Dash Punctuation | 1578 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 105034 | |
| i | 80659 | |
| t | 71524 | |
| a | 71307 | |
| s | 69041 | |
| r | 61659 | 7.6% |
| n | 59784 | 7.3% |
| o | 49069 | 6.0% |
| c | 46157 | 5.7% |
| l | 43205 | 5.3% |
| Other values (15) | 158930 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15103 | |
| C | 12851 | |
| E | 12680 | |
| I | 10110 | 7.8% |
| R | 9329 | 7.2% |
| P | 8650 | 6.7% |
| H | 8074 | 6.2% |
| M | 7984 | 6.2% |
| T | 6665 | 5.2% |
| A | 6601 | 5.1% |
| Other values (9) | 31242 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 20722 | |
| , | 1537 | 6.9% |
Space Separator
| Value | Count | Frequency (%) |
| 91390 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1578 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 945658 | |
| Common | 115227 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 105034 | 11.1% |
| i | 80659 | 8.5% |
| t | 71524 | 7.6% |
| a | 71307 | 7.5% |
| s | 69041 | 7.3% |
| r | 61659 | 6.5% |
| n | 59784 | 6.3% |
| o | 49069 | 5.2% |
| c | 46157 | 4.9% |
| l | 43205 | 4.6% |
| Other values (34) | 288219 |
Common
| Value | Count | Frequency (%) |
| 91390 | ||
| & | 20722 | 18.0% |
| - | 1578 | 1.4% |
| , | 1537 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1060885 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 105034 | 9.9% |
| 91390 | 8.6% | |
| i | 80659 | 7.6% |
| t | 71524 | 6.7% |
| a | 71307 | 6.7% |
| s | 69041 | 6.5% |
| r | 61659 | 5.8% |
| n | 59784 | 5.6% |
| o | 49069 | 4.6% |
| c | 46157 | 4.4% |
| Other values (38) | 355261 |
SPCODE
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| SP | |
|---|---|
| EX | 178 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 98494 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | SP |
| 3rd row | SP |
| 4th row | SP |
| 5th row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 49069 | |
| EX | 178 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sp | 49069 | |
| ex | 178 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 49069 | |
| P | 49069 | |
| E | 178 | 0.2% |
| X | 178 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 98494 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 49069 | |
| P | 49069 | |
| E | 178 | 0.2% |
| X | 178 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 98494 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 49069 | |
| P | 49069 | |
| E | 178 | 0.2% |
| X | 178 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98494 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 49069 | |
| P | 49069 | |
| E | 178 | 0.2% |
| X | 178 | 0.2% |
TICKER
Categorical
| Distinct | 495 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 384.9 KiB |
| CME | 264 |
|---|---|
| TFC | 179 |
| GE | 170 |
| WFC | 161 |
| BAC | 157 |
| Other values (490) |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.1061994 |
| Min length | 1 |
Characters and Unicode
| Total characters | 152971 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AAL |
|---|---|
| 2nd row | AAL |
| 3rd row | AAL |
| 4th row | AAL |
| 5th row | AAL |
Common Values
| Value | Count | Frequency (%) |
| CME | 264 | 0.5% |
| TFC | 179 | 0.4% |
| GE | 170 | 0.3% |
| WFC | 161 | 0.3% |
| BAC | 157 | 0.3% |
| PNC | 156 | 0.3% |
| KO | 154 | 0.3% |
| C | 153 | 0.3% |
| USB | 153 | 0.3% |
| CBOE | 152 | 0.3% |
| Other values (485) | 47548 |
Length
| Value | Count | Frequency (%) |
| cme | 264 | 0.5% |
| tfc | 179 | 0.4% |
| ge | 170 | 0.3% |
| wfc | 161 | 0.3% |
| bac | 157 | 0.3% |
| pnc | 156 | 0.3% |
| ko | 154 | 0.3% |
| c | 153 | 0.3% |
| usb | 153 | 0.3% |
| cboe | 152 | 0.3% |
| Other values (485) | 47548 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 11320 | 7.4% |
| A | 11113 | 7.3% |
| M | 9543 | 6.2% |
| T | 9354 | 6.1% |
| S | 8993 | 5.9% |
| L | 8310 | 5.4% |
| E | 8213 | 5.4% |
| R | 8019 | 5.2% |
| P | 7764 | 5.1% |
| N | 7352 | 4.8% |
| Other values (17) | 62990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 152764 | |
| Other Punctuation | 207 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 11320 | 7.4% |
| A | 11113 | 7.3% |
| M | 9543 | 6.2% |
| T | 9354 | 6.1% |
| S | 8993 | 5.9% |
| L | 8310 | 5.4% |
| E | 8213 | 5.4% |
| R | 8019 | 5.2% |
| P | 7764 | 5.1% |
| N | 7352 | 4.8% |
| Other values (16) | 62783 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 152764 | |
| Common | 207 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 11320 | 7.4% |
| A | 11113 | 7.3% |
| M | 9543 | 6.2% |
| T | 9354 | 6.1% |
| S | 8993 | 5.9% |
| L | 8310 | 5.4% |
| E | 8213 | 5.4% |
| R | 8019 | 5.2% |
| P | 7764 | 5.1% |
| N | 7352 | 4.8% |
| Other values (16) | 62783 |
Common
| Value | Count | Frequency (%) |
| . | 207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 152971 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 11320 | 7.4% |
| A | 11113 | 7.3% |
| M | 9543 | 6.2% |
| T | 9354 | 6.1% |
| S | 8993 | 5.9% |
| L | 8310 | 5.4% |
| E | 8213 | 5.4% |
| R | 8019 | 5.2% |
| P | 7764 | 5.1% |
| N | 7352 | 4.8% |
| Other values (17) | 62990 |
SUB_TELE
Real number (ℝ)
| Distinct | 150 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 168 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 536.79087 |
| Minimum | 31 |
|---|---|
| Maximum | 989 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 31 |
|---|---|
| 5-th percentile | 206 |
| Q1 | 312 |
| median | 513 |
| Q3 | 737 |
| 95-th percentile | 949 |
| Maximum | 989 |
| Range | 958 |
| Interquartile range (IQR) | 425 |
Descriptive statistics
| Standard deviation | 251.99508 |
|---|---|
| Coefficient of variation (CV) | 0.4694474 |
| Kurtosis | -1.217201 |
| Mean | 536.79087 |
| Median Absolute Deviation (MAD) | 207 |
| Skewness | 0.074568053 |
| Sum | 26345159 |
| Variance | 63501.52 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 212 | 3687 | 7.5% |
| 408 | 1877 | 3.8% |
| 713 | 1386 | 2.8% |
| 650 | 1177 | 2.4% |
| 847 | 1132 | 2.3% |
| 972 | 1078 | 2.2% |
| 800 | 1055 | 2.1% |
| 353 | 1052 | 2.1% |
| 703 | 1021 | 2.1% |
| 312 | 972 | 2.0% |
| Other values (140) | 34642 |
| Value | Count | Frequency (%) |
| 31 | 116 | 0.2% |
| 41 | 377 | |
| 44 | 321 | 0.7% |
| 201 | 336 | 0.7% |
| 202 | 244 | 0.5% |
| 203 | 902 | |
| 205 | 110 | 0.2% |
| 206 | 675 | |
| 207 | 82 | 0.2% |
| 208 | 101 | 0.2% |
| Value | Count | Frequency (%) |
| 989 | 10 | < 0.1% |
| 985 | 74 | 0.2% |
| 980 | 202 | 0.4% |
| 978 | 75 | 0.2% |
| 973 | 486 | |
| 972 | 1078 | |
| 952 | 193 | 0.4% |
| 951 | 63 | 0.1% |
| 949 | 310 | 0.6% |
| 941 | 78 | 0.2% |
NAICS
Real number (ℝ)
| Distinct | 205 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 382695.93 |
| Minimum | 42 |
|---|---|
| Maximum | 999977 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 42 |
|---|---|
| 5-th percentile | 2211 |
| Q1 | 325412 |
| median | 339113 |
| Q3 | 522210 |
| 95-th percentile | 561450 |
| Maximum | 999977 |
| Range | 999935 |
| Interquartile range (IQR) | 196798 |
Descriptive statistics
| Standard deviation | 182331.85 |
|---|---|
| Coefficient of variation (CV) | 0.47644053 |
| Kurtosis | 0.47772293 |
| Mean | 382695.93 |
| Median Absolute Deviation (MAD) | 174097 |
| Skewness | -0.5272223 |
| Sum | 1.8846627 × 1010 |
| Variance | 3.3244904 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 522110 | 2596 | 5.3% |
| 22111 | 1629 | 3.3% |
| 334413 | 1472 | 3.0% |
| 2211 | 1466 | 3.0% |
| 531120 | 1333 | 2.7% |
| 524126 | 1269 | 2.6% |
| 325412 | 1162 | 2.4% |
| 2111 | 928 | 1.9% |
| 531110 | 863 | 1.8% |
| 518210 | 863 | 1.8% |
| Other values (195) | 35666 |
| Value | Count | Frequency (%) |
| 42 | 110 | 0.2% |
| 111 | 11 | < 0.1% |
| 315 | 209 | 0.4% |
| 321 | 98 | 0.2% |
| 325 | 114 | 0.2% |
| 423 | 106 | 0.2% |
| 621 | 109 | 0.2% |
| 2111 | 928 | |
| 2211 | 1466 | |
| 3113 | 104 | 0.2% |
| Value | Count | Frequency (%) |
| 999977 | 274 | |
| 812331 | 67 | 0.1% |
| 722513 | 418 | |
| 722511 | 185 | |
| 721120 | 281 | |
| 721110 | 167 | 0.3% |
| 713210 | 36 | 0.1% |
| 711320 | 132 | 0.3% |
| 622110 | 158 | 0.3% |
| 621511 | 188 |
SPINDEX
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3484.1592 |
| Minimum | 1010 |
|---|---|
| Maximum | 6010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 1010 |
|---|---|
| 5-th percentile | 1510 |
| Q1 | 2510 |
| median | 3520 |
| Q3 | 4510 |
| 95-th percentile | 6010 |
| Maximum | 6010 |
| Range | 5000 |
| Interquartile range (IQR) | 2000 |
Descriptive statistics
| Standard deviation | 1349.3765 |
|---|---|
| Coefficient of variation (CV) | 0.38728898 |
| Kurtosis | -0.88633474 |
| Mean | 3484.1592 |
| Median Absolute Deviation (MAD) | 1000 |
| Skewness | 0.039699725 |
| Sum | 1.7158439 × 108 |
| Variance | 1820816.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2010 | 4390 | 8.9% |
| 5510 | 3395 | 6.9% |
| 3510 | 3260 | 6.6% |
| 4510 | 3070 | 6.2% |
| 4020 | 3005 | 6.1% |
| 1510 | 2847 | 5.8% |
| 6010 | 2628 | 5.3% |
| 4030 | 2576 | 5.2% |
| 3520 | 2569 | 5.2% |
| 3020 | 2346 | 4.8% |
| Other values (14) | 19161 |
| Value | Count | Frequency (%) |
| 1010 | 2187 | |
| 1510 | 2847 | |
| 2010 | 4390 | |
| 2020 | 910 | 1.8% |
| 2030 | 1506 | 3.1% |
| 2510 | 542 | 1.1% |
| 2520 | 1261 | 2.6% |
| 2530 | 1597 | 3.2% |
| 2550 | 2049 | |
| 3010 | 589 | 1.2% |
| Value | Count | Frequency (%) |
| 6010 | 2628 | |
| 5510 | 3395 | |
| 5020 | 1611 | |
| 5010 | 429 | 0.9% |
| 4530 | 1854 | |
| 4520 | 1652 | |
| 4510 | 3070 | |
| 4030 | 2576 | |
| 4020 | 3005 | |
| 4010 | 2336 |
SIC
Real number (ℝ)
| Distinct | 178 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4822.9585 |
| Minimum | 100 |
|---|---|
| Maximum | 9997 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 384.9 KiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 2030 |
| Q1 | 3572 |
| median | 4911 |
| Q3 | 6282 |
| 95-th percentile | 7373 |
| Maximum | 9997 |
| Range | 9897 |
| Interquartile range (IQR) | 2710 |
Descriptive statistics
| Standard deviation | 1839.929 |
|---|---|
| Coefficient of variation (CV) | 0.38149384 |
| Kurtosis | -0.67119573 |
| Mean | 4822.9585 |
| Median Absolute Deviation (MAD) | 1351 |
| Skewness | 0.10163571 |
| Sum | 2.3751624 × 108 |
| Variance | 3385338.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6020 | 2562 | 5.2% |
| 6798 | 2422 | 4.9% |
| 7370 | 1741 | 3.5% |
| 4931 | 1606 | 3.3% |
| 3674 | 1472 | 3.0% |
| 4911 | 1376 | 2.8% |
| 6331 | 1218 | 2.5% |
| 2834 | 1162 | 2.4% |
| 7372 | 1053 | 2.1% |
| 1311 | 922 | 1.9% |
| Other values (168) | 33713 |
| Value | Count | Frequency (%) |
| 100 | 11 | < 0.1% |
| 1000 | 104 | 0.2% |
| 1040 | 111 | 0.2% |
| 1311 | 922 | |
| 1389 | 254 | 0.5% |
| 1400 | 204 | 0.4% |
| 1531 | 360 | 0.7% |
| 1731 | 94 | 0.2% |
| 2000 | 111 | 0.2% |
| 2011 | 217 | 0.4% |
| Value | Count | Frequency (%) |
| 9997 | 386 | |
| 8742 | 114 | 0.2% |
| 8731 | 152 | 0.3% |
| 8721 | 87 | 0.2% |
| 8700 | 89 | 0.2% |
| 8090 | 94 | 0.2% |
| 8071 | 188 | |
| 8062 | 158 | |
| 8000 | 109 | 0.2% |
| 7990 | 317 |
| GVKEY | DIRNBR | CASH_FEES | STOCK_AWARDS | OPTION_AWARDS | NONEQ_INCENT | PENSION_CHG | OTHCOMP | TOTAL_SEC | YEAR | SUB_TELE | NAICS | SPINDEX | SIC | EXCHANGE | STATE | SPCODE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GVKEY | 1.000 | -0.089 | -0.148 | 0.005 | 0.060 | -0.057 | -0.099 | -0.203 | -0.021 | 0.027 | -0.020 | 0.274 | 0.127 | 0.231 | 0.174 | 0.241 | 0.060 |
| DIRNBR | -0.089 | 1.000 | 0.049 | 0.015 | -0.069 | -0.004 | 0.017 | 0.060 | 0.011 | 0.028 | -0.022 | -0.020 | -0.008 | -0.012 | 0.067 | 0.061 | 0.000 |
| CASH_FEES | -0.148 | 0.049 | 1.000 | 0.190 | -0.112 | -0.027 | 0.061 | 0.197 | 0.523 | 0.171 | -0.004 | -0.089 | -0.083 | -0.104 | 0.007 | 0.019 | 0.000 |
| STOCK_AWARDS | 0.005 | 0.015 | 0.190 | 1.000 | -0.302 | -0.006 | -0.016 | 0.056 | 0.687 | 0.286 | 0.015 | -0.008 | 0.027 | -0.002 | 0.003 | 0.000 | 0.000 |
| OPTION_AWARDS | 0.060 | -0.069 | -0.112 | -0.302 | 1.000 | 0.001 | -0.011 | -0.107 | 0.066 | -0.143 | 0.022 | 0.019 | -0.003 | -0.011 | 0.024 | 0.000 | 0.000 |
| NONEQ_INCENT | -0.057 | -0.004 | -0.027 | -0.006 | 0.001 | 1.000 | 0.067 | 0.042 | 0.008 | -0.017 | -0.045 | -0.035 | -0.064 | -0.055 | 0.004 | 0.000 | 0.000 |
| PENSION_CHG | -0.099 | 0.017 | 0.061 | -0.016 | -0.011 | 0.067 | 1.000 | 0.076 | 0.052 | -0.045 | -0.003 | -0.083 | 0.031 | -0.032 | 0.020 | 0.036 | 0.000 |
| OTHCOMP | -0.203 | 0.060 | 0.197 | 0.056 | -0.107 | 0.042 | 0.076 | 1.000 | 0.227 | -0.020 | 0.031 | -0.113 | -0.147 | -0.137 | 0.000 | 0.000 | 0.000 |
| TOTAL_SEC | -0.021 | 0.011 | 0.523 | 0.687 | 0.066 | 0.008 | 0.052 | 0.227 | 1.000 | 0.280 | 0.040 | -0.031 | -0.017 | -0.039 | 0.003 | 0.000 | 0.000 |
| YEAR | 0.027 | 0.028 | 0.171 | 0.286 | -0.143 | -0.017 | -0.045 | -0.020 | 0.280 | 1.000 | 0.002 | 0.004 | 0.001 | 0.005 | 0.000 | 0.000 | 0.000 |
| SUB_TELE | -0.020 | -0.022 | -0.004 | 0.015 | 0.022 | -0.045 | -0.003 | 0.031 | 0.040 | 0.002 | 1.000 | -0.091 | -0.001 | -0.004 | 0.152 | 0.542 | 0.108 |
| NAICS | 0.274 | -0.020 | -0.089 | -0.008 | 0.019 | -0.035 | -0.083 | -0.113 | -0.031 | 0.004 | -0.091 | 1.000 | 0.267 | 0.775 | 0.143 | 0.382 | 0.082 |
| SPINDEX | 0.127 | -0.008 | -0.083 | 0.027 | -0.003 | -0.064 | 0.031 | -0.147 | -0.017 | 0.001 | -0.001 | 0.267 | 1.000 | 0.473 | 0.267 | 0.356 | 0.181 |
| SIC | 0.231 | -0.012 | -0.104 | -0.002 | -0.011 | -0.055 | -0.032 | -0.137 | -0.039 | 0.005 | -0.004 | 0.775 | 0.473 | 1.000 | 0.189 | 0.339 | 0.165 |
| EXCHANGE | 0.174 | 0.067 | 0.007 | 0.003 | 0.024 | 0.004 | 0.020 | 0.000 | 0.003 | 0.000 | 0.152 | 0.143 | 0.267 | 0.189 | 1.000 | 0.317 | 0.096 |
| STATE | 0.241 | 0.061 | 0.019 | 0.000 | 0.000 | 0.000 | 0.036 | 0.000 | 0.000 | 0.000 | 0.542 | 0.382 | 0.356 | 0.339 | 0.317 | 1.000 | 0.109 |
| SPCODE | 0.060 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.108 | 0.082 | 0.181 | 0.165 | 0.096 | 0.109 | 1.000 |
| GVKEY | DIRNBR | DIRNAME | CASH_FEES | STOCK_AWARDS | OPTION_AWARDS | NONEQ_INCENT | PENSION_CHG | OTHCOMP | TOTAL_SEC | YEAR | CONAME | CUSIP | EXCHANGE | ADDRESS | CITY | STATE | ZIP | SICDESC | NAICSDESC | INDDESC | SPCODE | TICKER | SUB_TELE | NAICS | SPINDEX | SIC | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1045 | 1 | Roger T. Staubach | 37.0 | 24.103 | 0.0 | 0.0 | 0.000 | 8.349 | 69.452 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 1 | 1045 | 2 | Ann McLaughlin Korologos | 39.0 | 18.949 | 0.0 | 0.0 | 14.485 | 8.521 | 80.955 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 2 | 1045 | 3 | Judith Rodin, Ph.D. | 37.0 | 24.103 | 0.0 | 0.0 | 0.000 | 15.078 | 76.181 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 3 | 1045 | 4 | David L. Boren | 39.0 | 18.949 | 0.0 | 0.0 | 16.300 | 3.078 | 77.327 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 4 | 1045 | 5 | Ray M. Robinson, Jr. | 39.0 | 24.103 | 0.0 | 0.0 | 0.000 | 9.630 | 72.733 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 5 | 1045 | 6 | Armando M. Codina | 37.0 | 18.949 | 0.0 | 0.0 | 13.338 | 5.433 | 74.720 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 6 | 1045 | 7 | Michael A. Miles | 39.0 | 24.103 | 0.0 | 0.0 | 0.000 | 5.223 | 68.326 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 7 | 1045 | 8 | John W. Bachmann | 39.0 | 24.103 | 0.0 | 0.0 | 0.000 | 17.557 | 80.660 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 8 | 1045 | 9 | Rajat Kumar Gupta | 38.0 | 24.103 | 0.0 | 0.0 | 0.000 | 8.827 | 70.930 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| 9 | 1045 | 10 | Philip J. Purcell, III | 39.0 | 24.103 | 0.0 | 0.0 | 0.000 | 5.272 | 68.375 | 2010 | AMERICAN AIRLINES GROUP INC | 02376R10 | NAS | 1 Skyview Drive | Fort Worth | TX | 76155 | AIR TRANSPORTATION, SCHEDULED | Scheduled Passenger Air Transportation | Airlines | SP | AAL | 682.0 | 481111 | 2030 | 4512 |
| GVKEY | DIRNBR | DIRNAME | CASH_FEES | STOCK_AWARDS | OPTION_AWARDS | NONEQ_INCENT | PENSION_CHG | OTHCOMP | TOTAL_SEC | YEAR | CONAME | CUSIP | EXCHANGE | ADDRESS | CITY | STATE | ZIP | SICDESC | NAICSDESC | INDDESC | SPCODE | TICKER | SUB_TELE | NAICS | SPINDEX | SIC | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 49237 | 316056 | 4 | Carla Cico | 140.000 | 100.033 | 0.0 | 0.0 | 0.0 | 0.0 | 240.033 | 2018 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49238 | 316056 | 5 | Charles L. Szews | 103.846 | 100.033 | 0.0 | 0.0 | 0.0 | 0.0 | 203.879 | 2018 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49239 | 316056 | 6 | Dean I. Schaffer | 152.000 | 100.033 | 0.0 | 0.0 | 0.0 | 0.0 | 252.033 | 2018 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49240 | 316056 | 7 | Michael J. Chesser | 15.167 | 0.000 | 0.0 | 0.0 | 0.0 | 0.0 | 15.167 | 2018 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49241 | 316056 | 1 | Nicole Parent Haughey | 140.000 | 100.067 | 0.0 | 0.0 | 0.0 | 0.0 | 240.067 | 2019 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49242 | 316056 | 2 | Kirk S. Hachigian | 165.000 | 100.067 | 0.0 | 0.0 | 0.0 | 0.0 | 265.067 | 2019 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49243 | 316056 | 3 | Martin E. Welch, III | 155.000 | 100.067 | 0.0 | 0.0 | 0.0 | 0.0 | 255.067 | 2019 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49244 | 316056 | 4 | Carla Cico | 140.000 | 100.067 | 0.0 | 0.0 | 0.0 | 0.0 | 240.067 | 2019 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49245 | 316056 | 5 | Charles L. Szews | 140.000 | 100.067 | 0.0 | 0.0 | 0.0 | 0.0 | 240.067 | 2019 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |
| 49246 | 316056 | 6 | Dean I. Schaffer | 152.000 | 100.067 | 0.0 | 0.0 | 0.0 | 0.0 | 252.067 | 2019 | ALLEGION PLC | G0176J10 | NYS | Iveagh Court, Block D, Harcourt Road | Dublin | NaN | D02 V | CUTLERY, HANDTOOLS, AND GENERAL HARDWARE | Hardware Manufacturing | Building Products | SP | ALLE | 353.0 | 332510 | 2010 | 3420 |